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VECTORS DERIVED FROM PLASMIDS 
Introduction to Plasmid Biology 



Plasmids are self-replicating, extrachro- 
mosomal DNA molecules found in virtually all 
bacterial species. In nature, plasmids occur in 
exuberant profusion, varying in structure, size, 
mode of replication, number of copies per bac- 
terial cell, ability to propagate in different bac- 
teria, transferability between bacterial species, 
and perhaps most important, in the traits they 
carry. Most prokaryotic plasmids are double- 
stranded circular DNA molecules; however, 
linear plasmids have been identified in both 
gram-positive and gram-negative bacteria. The 
size of plasmids varies widely, from several 
kilobases to hundreds of kilobases. Replication 
of plasmids depends on host-cell proteins but 
also may require pi asm id -encoded functions. 
Plasmid replication may be synchronized with 
the bacterial cell cycle, resulting in a low num- 
ber of plasmid molecules per bacterial cell, or 
independent of the host cell cycle, allowing for 
the proliferation of hundreds of plasmid copies 
per cell. Some plasmids freely transfer their 
DNA across bacterial species, some only trans- 
fer their DNA into bacteria of the same species, 
and some do not transfer their DNA at all. 
Plasmids carry genes that specify a wide variety 
of functions including: resistance to antibiotics, 
resistance to heavy metals, sensitivity to mu- 
tagens, sensitivity or resistance to bacterio- 
phages, production of restriction enzymes, pro- 
duction of rare amino acids, production of tox- 
ins, determination of virulence, catabolism of 
complicated organic molecules, ability to form 
symbiotic relationships, and ability to transfer 
DNA across kingdoms. 

Starting in the 1970s, vectors for propaga- 
tion, manipulation, and delivery of specific 
DNA sequences were constructed with fi-ag- 
ments from naturally occurring plasmids, pri- 
marily Escherichia coli plasmids. All plasmid 
vectors contain three common features: a rep- 
licator, a selectable marker, and a cloning site. 
The replicator is a stretch of DNA that contains 
the site at which DNA replication begins (the 
origin of replication or ori\ and that also in- 
cludes genes encoding whatever plasmid-en- 
coded RNAs and proteins are necessary for 
replication. The selectable marker, necessary 
for following and maintaining the presence of 
the plasmid in cells, is usually dominant and is 
usually a gene encoding resistance to some 



antibiotic. The cloning site is a restriction en- 
donuclease cleavage site into which foreign 
DNA can be inserted without interfering with 
the plasmid's ability to replicate or to confer 
the selectable phenotype on its host. Over the 
years, plasmid vectors have increased in so- 
phistication; in addition to these three basic 
elements many vectors now include features 
that make them particularly suitable for specific 
types of experiments. 

UNITS /.6<fe 7.7 describe procedures for making 
plasmid DNA. The process by which plasmids 
are introduced into E, coli is called transfor- 
mation. Transformation protocols are given in 
UNIT L8. For a list of vectors and their salient 
features see appendix 5. 

REPLICATORS 

One of the ways that replicators are classi- 
fied is based on the number of plasmid mole- 
cules maintained per bacterial cell under some 
set of standard growth conditions, the so-called 
copy number of the plasmid. This book defines 
high-copy-number plasmids as those which 
exist in >20 copies per bacterial cell grown in 
liquid LB medium, and low-copy-number 
plasmids as those which exist in <20 copies per 
cell. High-copy-number-plasmids tend to be 
smaller than low-copy-number plasmids. They 
are more commonly used in molecular biologi- 
cal techniques since it is easier to prepare large 
quantities of pure plasmid DNA from cells that 
bear them. Low-copy-number plasmids are 
utilized when it is important to control the gene 
dosage of a cloned sequence, for example when 
the DNA sequence or the protein it encodes 
makes the host organism sick (see Table 1.5. 1 ). 

High-copy-number plasmids tend to be un- 
der relaxed control of replication. These re- 
laxed plasmids initiate DNA replication in a 
process controlled by plasmid-encoded func- 
tions (see Mechanism of Replication and Copy- 
Number Control), and replication does not de- 
pend on the unstable host replication initiation 
proteins synthesized at the start of the bacterial 
cell cycle. Because their replication depends 
only on the stable host enzymatic replication 
machinery, the copy number of these plasmids 
can be greatly increased, or amplified, by treat- 
ment of the plasmid-containing cells with pro- 
tein synthesis inhibitors such as chlorampheni- 
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Table 1,5.1.' Characteristics of Commonly Used Plasmid Replicators 



Replicator 


Prototype 
plasmid 


Size (bp) 


Markers on 
prototype 


Cody" 
number 


References 


pMBl 


pBR322 


4,362 


Ap^, Tet^ 




rsoiivar ei ai., ly// 


ColEl 


pMK16 


-4,500 


Kan^ Tet^ 
ColEl*"^ 


High- >1 S 


T^nhn e^f ol 1090 

ivann ei ai., ly/y 


pl5A 


pACYC184 


-4,000 


EmF, Tet^ 


High; -15 


Chang et al., 1978 


pSClOl 


pLG338 


-^7,300 




Low J — 6 


olOKer et ai., ivoz 


F 


pDF41 


-12,800 


TrpE 


T nw 1 tn 9 


jvann ei ai., ly/y 


R6K 


dRK353 


-11,100 


TmF 


L<UW, J 


VoKm q*- ol 1 mo 
Kann et ai., ly /y 


Rl (R\drd-17) 


pBEU50 


-10,000 


Ap^ Tet^ 


Low at 30°C; high 
above 35°C* 


Uhlin etal., 1983 


RK2 


pRK2501 


-11,100 


Kan^ Tet^ 


Low; 2 to 4 


Kahnetal., 1979 


Xdv 


X dvgal 


c 


Gal 




Jackson etal., 1972 



''Copy numbers are for the prototype plasmid. Plasmid vectors that contain replicators derived from these plasmids may have different 
copy numbers due to introduction of mutations into the replicator. For example, pUC series (pmBl -derived) has copy numbers of 
1000-3000. 

temperature sensitive. 
'^Not known. 
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col or spectinomycin. These protein synthesis 
inhibitors prevent synthesis of the replication- 
initiation proteins required for chromosomal 
replication, allowing the replication enzymes 
to be commandeered for plasmid replication. 
High-copy-number plasmids usually do not 
have any mechanism to ensure correct segrega- 
tion of the plasmids to the daughter cells; they 
rely on random assortment to partition at least 
one copy of the plasmid to each daughter. 

Low-copy- number plasmids are usually un- 
der stringent control. Initiation of replication 
of these plasmids depends on unstable proteins 
synthesized at the start of the bacterial cell cycle 
and thus is synchronized with the replication of 
the bacterial chromosome. As copy number 
decreases, random segregation of plasmid cop- 
ies is not sufficient to ensure that each daughter 
cell acquires a copy of the plasmid. However, 
most low-copy-number plasmids carry genes 
that guarantee their maintenance in the bacte- 
rial population. Stabilizing loci that encode a 
mechanism for postsegregational killing of 
plasmid-free daughter cells and a system for 
efficiently resolving plasmid multimers so that 
they can be appropriately partitioned have been 
identified. In addition, cis- and rra/i^-acting 
partitioning loci (par) that are thought to con- 
stitute an active mechanism for distribution of 
plasmid copies to daughter cells have also been 
identified. 



MECHANISM OF REPLICATION 
AND COPY-NUMBER CONTROL 

While there are many different replicators, 
the majority of plasmid vectors used in routine 
recombinant DNA work contain one of the 
functionally similar replicators derived from 
plasmids ColEl or pMBl (for example, the 
popular pBluescript series contain a ColEl ori- 
gin, and the pUC plasmids are derived from 
pMB 1). The ColEl replicator is a 600-nucleo- 
tide DNA fragment that contains the origin of 
replication (ori\ encodes an RNA primer, and 
encodes two negative regulators of replication 
initiation. All enzymatic functions for replica- 
tion of the plasmid are provided by the bacterial 
host. 

Plasmid replication begins with transcrip- 
tion of an RNA primer upstream of the ori 
(RNAII; see Fig. 1,5.1) by the host RNA po- 
lymerase. RNAII is elongated through and ter- 
minated downstream of the ori. Interaction of 
a specific secondary structure in the nascent 
RNAII transcript with the DNA template re- 
sults in formation of a persistent hybrid be- 
tween RNAII and the DNA template such that 
RNAII remains paired with the DNA template 
at the ori. The RNAII transcript is cleaved by 
RNAseH at the ori sequence. This processed 
RNAII primer is extended by DNA polymerase 
I to initiate plasmid replication. Regulation of 
the proper RNAII secondary structure controls 
initiation of DNA replication and is responsible 
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for determining the number of plasmid mole- 
cules per cell. 

Both ColEl and pMBl plasmids are high- 
copy-number plasmids, maintained at between 
1 5 and 25 copies per bacterial cell, respectively. 
The copy number of these plasmids is regulated 
by an antisense RNA transcript, RNAI, and the 
protein ROP, the product of the rop gene. RNAI 
and the ROP protein act in concert to intercept 
formation of the proper RNAII secondary 



structure. RNAI is exactly complementary to 
the 5' end of the RNAII transcript. RNAI base 
pairs with the 5' end of RNAII, preventing the 
formation of the specific secondary structure 
necessary for establishment of a persistent hy- 
brid between RNAII and the DNA template that 
is a prerequisite for maturation of the RNAII 
primer. ROP protein stabilizes the interaction 
between the antisense RNAI and the primer 
RNAII. Mutations that disrupt or destabilize 



RNAI 



jo 



RNAII (preprimer) 



on 



RNAII 



elongation of RNAII transcription 



no binding 




binding of RNAI 



formation of persistent hybrid at on 
RNAII 



RNAI 



on 



on 



orit 

formation of RNA primer ' 



RNAI/RNAII duplex 



RNAII 



RNase H cleavage 



on 



^ no persistent hybrid formed at on 

"^*S!#!3Ma««- no primer maturation 



initiation of DNA leading-strand synthesis 



Figure 1 .5.1 ColEI replication initiation and copy number control. C0IEI replication requires the 
formation of a RNA primer to initiate DNA synthesis. The RNA primer is derived from processing 
of a transcript, RNAII (bold), that starts upstream of the on. Appropriate processing of the RNAn 
transcnpt is dependent on formation of a persistent hybrid between RNAII and the oriDN A template 
If the proper RNAII secondary structure forms, then the RNA/DNA duplex is maintained at the ori 
" ""^^^"^ *fanscript to generate the primerfor DNA synthesis. Processing 

of RNAII to form the primer is regulated by a second transcript, RNAI. RNAI is complementary to 
the 5' end of RNAII. If the RNAI transcript forms a duplex with RNAII, RNAII cannot take on the 
secondary structure necessary to create the persistent hybrid at the or/ and maturation of the RNAII 
pnmer is inhibited. The copy number of C0IEI plasmids is determined by the balance between 
successful RNAII processing events and those inhibited by RNAI. Adapted from Gerhart et al (1 994) 
with permission from the Annual Review of Microbiology. 
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the RNAI/RNAII interaction result in a higher 
plasmid copy number. For example, the pUC 
plasmid series have pMBl replicators that do 
not contain an intact rop gene, which accounts 
for a 2-fold increase in copy number over plas- 
mids with intact pMB 1 replicons. Interestingly, 
additional mutation(s) in the origin are the main 
factor in the ultra-high-copy number (1000 to 
3000) of this plasmid series (K. Struhl, unpub. 
observ.) 

PLASMID INCOMPATIBILITY 

For experiments that require that more than 
one plasmid vector be maintained in a bacterial 
cell at the same time, another critical feature of 
plasmid replicators is whether or not two plas- 
mid replicators are compatible. Two plasmids 
are said to be incompatible with one another, 
and hence belong to the same incompatibility 
group, if they cannot stably co-exist. Plasmids 
are generially incompatible if they share any 
function required for the regulation of plasmid 
replication. For example, ColEl- and pMBl- 
derived plasmids are incompatible with one 
another but are compatible with pi 5 A plas- 
mids. The incompatibility of ColEl and pMB 1 
plasmids is a consequence of two facts: first, 
that plasmid DNA replication of these plasmids 
is negatively controlled by RNAI which acts in 
trans on other plasmids with the same primer 
RNA, and second, that these plasmids lack a 
mechanism to ensure that each plasmid in a cell 
replicates once per cell cycle. Therefore, if a 
cell that contains a pMBI-derived plasmid is 
subsequently transformed with a CoIEl -de- 
rived plasmid, cells selected to contain the 
ColEl plasmid will usually have lost the pMBI 
plasmid. 

SELECTABLE MARKERS 

In order to guarantee that a plasmid vector 
is taken up by and maintained in bacterial cells, 
there must be a way to select for plasmid-con- 
taining cells. Genes encoding resistance to an- 
tibiotics such as ampicillin, tetracycline, 
kanamycin and chloramphenicol are the most 
common bacterial selectable markers for plas- 
mid vectors. Typically, cells are transformed 
with plasmid DNA using the technique de- 
scribed in UNrr rs and then plated out on LB 
plates that contain the proper antibiotic (see 
recipes in UNrr i.j). Only the bacterial cells 
containing the plasmid will grow on the selec- 
tive medium; the antibiotic-resistance pheno- 
type conferred is dominant to the antibiotic- 
sensitive phenotype of cells that do not possess 
the plasmid vector. Another dominant select- 



able marker that is occasionally used is the 
mmunity to infection by phage lambda 
(lambda repressor). In addirion, recessive 
markers are sometimes used in plasmid selec- 
tions; for example, l€uB~ E. coli cannot grow 
in the absence of leucine, and selection for 
growth of these strains in the absence of leucine 
allows isolation of colonies transformed with a 
plasmid that contains a gene that complements 
leuB, 

For experiments that involve introduction of 
plasmid vectors into systems other than bacte- 
ria, such as yeast or mammalian cells, it is 
usually necessary to select for the presence of 
plasmid DNA in these hosts. Selectable mark- 
ers for yeast and mammalian systems are gen- 
erally distinct from those used in E. coli (see 
Plasmid Vectors for Yeast and Plasmid Vectors 
for Expression in Cultured Mammalian Cells). 
In cases where a vector is needed that will be 
used in both E. coli and another host, it is 
necessary for the plasmid vector to carry select- 
able markers for both hosts. 

CLONING SITE 

Today's plasmid vectors contain a multiple 
cloning site (MCS) or polylinker cloning re- 
gion that can include >20 tandemly arranged 
restriction endonuclease sites. The sites in the 
polylinker are almost always designed to be 
unique within the vector sequence so that cut- 
ting the vector with a restriction endonuclease 
in the polylinker and cloning foreign DNA into 
this site does not disrupt other critical features 
of the vector. The plethora of sites in the 
polylinker ensures that the appropriate enzyme 
sites will be available for cloning most DNA 
fragments, provides unique reference restric- 
tion sites for rapid restriction mapping of the 
insert, and generally allows for a great deal of 
flexibility when manipulating the cloned DNA. 

The sequences that directly flank the 
polylinker site are often useful for manipulation 
or analysis of insert DNA. Many polylinker 
sites are flanked by sequences for which there 
are commercially available complementary oli- 
gonucleotides, for example the M13 reverse, 
-20, and -40 primers, that can be used for 
priming polymerase chain reactions (PCR) or 
DNA sequencing reactions. Such primers are 
useful tools for amplification or sequencing of 
any DNA fragment inserted into the polylinker. 
Some polylinkers are bordered by 8-bp-cutter 
restriction sites, like Not\, These sites occur 
infrequently in DNA and thus allow for the easy 
excision of an intact insert fragment from the 
plasmid vector. 
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' Many plasmid vectors have been developed 
with bacteriophage— SP6, T7, or T3— promot- 
ers flanking the polylinker cloning sites. These 
promoters can be used in vitro or in vivo, if the 
bacteriophage RNA polymerase is also present, 
to produce large quantities of RNA transcripts 
from DNA inserted into the polylinker. 

In order to make it easier to identify plas- 
mids that contain insert DNA, the polylinkers 
of some vectors have been engineered so that 
introduction of DNA into the polylinker results 
in a scorable phenotype. The most common 
example of this is insertion of the polylinker of 
many basic cloning vectors, the pUC series for 
example, into the lacZgcne a fragment. In the 
appropriate genetic background, production of 
the lacZ a fragment allows for formation of an 
active P-galactosidase enzyme which results in 
the formation of blue colonies on Xgal/IPTG 
indicator plates. Cloning into these polylinkers 
prevents production of a functional lacZ a 
fragment, allowing for rapid identification of 
plasmids containing inserts as white colonies 
on Xgal/IPTG indicator plates. Vectors have 
also been developed that allow for direct selec- 
tion of plasmids containing inserts by location 
of the polylinker in the ccdB (control of cell 
death) gene which causes cell death in £. colL 
Disruption of the ccdB gene by introduction of 
an insert into the polylinker allows the cells to 
survive, and thus only recombinants will grow 
under conditions where the ccdB gene is ex- 
pressed. 

CHOOSING A PLASMID VECTOR 

Some of the basic features to consider when 
selecting a plasmid vector include the size of 
the vector, its copy number, the polylinker^ and 
the ability to select and/or screen for inserts. 
Large plasmids, >15 kb, do not transform well 
and frequently give lower DNA yields. Con- 
sider the final size of the vector plus insert when 
planning an experiment and wherever possible 
use smaller vectors. The higher the copy num- 
ber the more vector DNA is produced, but 
high-copy-number vectors may not be applica- 
ble to all situations (see Mechanism of Repli- 
cation and Copy-Number Control). When 
choosing a plasmid vector, consider both what 
sites are present in the polylinker and the order 
of the sites. If it will be necessary to manipulate 
the cloned sequence subsequent to insertion 
into the polylinker, plan ahead to ensure that 
the necessary sites will remain available in the 
polylinker. Selections or screens for identifica- 
tion of recombinant clones are useful for ex- 
periments where cloning efficiency is expected 



to be low or when generation of a large number 
of clones .is necessary. However, for routine 
siibcloning experiments the advent of PCR 
(vNmsj) has made it possible to rapidly screen 
through a large number of transformants to 
identify potential recombinant molecules, ob- 
viating the need for histochemical and genetic 
screening methods. 

The primary factor in choosing a plasmid 
vector is to understand and anticipate the ex- 
periments for which the recombinant clone will 
be used. The specialized functions of plasmid 
vectors are generally the key to selecting the 
correct vector for an experiment. For example, 
completely different types of vectors would be 
used for generating large quantities of DNA, 
expressing a fusion protein in bacteria, or for 
undertaking a two-hybrid screen in yeast. Once 
the type of vector required is determined then 
deciding upon a particular vector is dependent 
on both the details of the ancillary vector fea- 
tures, for example the type of promoter used to 
express a recombinant protein, and the proper- 
ties of the replicator, polylinker, and selectable 
marker. 

PLASMID VECTORS FOR 
PRODUCTION OF 
SINGLE-STRANDED DNA 

Plasmids have been developed that contain 
a filamentous phage origin of replication in 
addition to a plasmid ori. These "phagemid" 
vectors (unitj.m) can be grown and propagated 
as plasmids. However, upon super-infection of 
a plasmid-containing cell with a wild-type 
helper phage, the phage ori becomes active, and 
single-stranded DNA (ssDNA) is produced and 
secreted. There are usually (+) and (-) versions 
of these vectors where the phage ori is in 
opposite orientation so that it is possible to 
produce ssDNA from either DNA strand. For 
many years, ssDNA was the substrate of choice 
for DNA sequencing (uNrr 7.4) and oligonu- 
cleotide-directed mutagenesis (units, i). Devel- 
opment of sequencing and mutagenic protocols 
that use double-stranded templates has made 
the production of ssDNA a less frequently util- 
ized feature of plasmid vectors. The 
pBluescripa, pBluescriptll, and pBS phagemid 
vectors derived from the general cloning vector 
pUC19 (Fig. 1 .5.2) are examples of phagemids 
that incorporate the f 1 filamentous phage ori. 

PLASMID VECTORS FOR 
CLONING LARGE INSERTS 

Cosmid vectors, plasmids carrying a lambda 
phage cos site (e.g., pWE15, Fig. 1.5.3). were 
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Figure 1 .5.2 Map of pUC1 9. 
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Figure 1.5.3 Map of pWE1 5 (adapted from Wahl et al., 1987. with permission). 
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developed to facilitate cloning of large DNA 
fragments (unit uo), Cosmids can be trans- 
formed into cells like plasmids and once in the 
cells, replicate using their plasmid orL The 
CoIEl type replicators are the most commonly 
used in these vectors, and cosmids can gener- 
ally be maintained at high-copy-number in E. 
colL Cosmids can also be packaged into lambda 
phage heads. In order for packaging to occur 
the cos sites must be separated by 40 to 50 kb, 
the approximate length of the wild-type lambda 
genome. Many cosmid vectors are between 5 
to 10 kb in size and therefore can accept inserts 
of 30 to 45 kb. Because of the size of inserts 
they can accept, the high efficiency of packag- 
ing recombinant molecules into phage, and the 
efficiency of infection of cosmid-containing 
phage heads, cosmid vectors are frequently 
used for making genomic libraries. Unfortu- 
nately, propagation of insert-containing cosmid 
vectors in E. coli sometimes results in the de- 
letion of all or a portion of the insert. To address 
this problem, a new set of cosmid vectors have 
been developed that replace the ColEl replica- 
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tor with the F factor replicator. These "fosmid" 
vectors are maintained at low-copy-number, 1 
to 2 copies per cell, and are more stable than 
higher-copy-number cosmid vectors when 
grown in E. coli (Kim et al., 1992). 

Another type of plasmid cloning vector, 
called bacterial artificial chromosome (BAG; 
Fig. L5.4), has been developed using the F 
factor replicator for propagation of very large 
pieces of DNA (100 to 500 kb). The vectors are 
used in a similar manner to yeast artificial 
chromosome (YAC) vectors but have the ad- 
vantage of being manipulated solely in E. coli. 

PLASMID VECTORS FOR 
EXPRESSION OF LARGE 
QUANTITIES OF RECOMBINANT 
PROTEINS 

There are a wide variety of vectors for ex- 
pressing high levels of recombinant proteins in 
E. coli, insect, and mammalian cells (Chapter 
16). The general goal of expressing proteins in 
any of these systems is to produce large quan- 
tities of a particular protein upon demand. 



Escherichia coli, 
Plasmids, and 
Bacteriophages 

1.5.7 

Supplement 41 



Introduction to 
Plasmid Biology 

1.5.8 

Supplement 41 



While the features of the protein expression 
systems vary considerably, the basic properties 
outlined for expressing proteins in E. coli are 
common to all of them. Expression vectors are 
usually designed such that production of the 
foreign protein is tightly regulated. This is nec- 
essary because the host cellular machinery is 
co-opted to produce large quantities of the for- 
eign protein, the shear amount of which may 
be toxic to the cell, and/or the foreign protein 
may encode a function that will inhibit cell 
growth or kill the host cell. Generally, expres- 
sion vectors are configured so the polylinker 
cloning site is downstream of an inducible pro- 
moter. One of the promoters commonly used in 
E. coli expression vectors is the hybrid trp/lac 
promoter (trc) which contains the lacO operator 
site. This promoter is turned off in the presence 
of the lacP repressor; the repressor gene is 
either carried by the bacterial host or is encoded 
on the expression vector itself. Expression of 
the foreign protein from the trc promoter is 
induced by the addition of IPTG. 

The quantity of foreign protein produced 
will be determined by both the rate of transcrip- 
tion of the gene and the efficiency of translation 
of the mRNA. Therefore, in addition to regu- 
lated highly inducible promoters, many £. coli 
protein expression vectors are designed to op- 
timize translation of the foreign protein in bac- 
terial cells. These expression vectors include an 
efficient ribosome binding site and an ATG start 
codon uptsream of the polylinker cloning site. 
Usually the cloning sites in the polylinker are 
designed so that it is possible to make an in- 
frame fusion to the protein of interest in all three 
reading frames. 

For many experiments, high levels of pure 
recombinant protein are required, and some 
expression vectors are designed to create 
tagged or fusion proteins that facilitate purifi- 
cation of the recombinant protein. Tag se- 
quences may be located 5' or 3' to the poly- 
linker, creating either amino- or carboxy-termi- 
nal-tagged fusion proteins. Six polyhistidine 
residues (vnit io.jjb), the FLAG epitope, and 
glutathione-S-transferase protein (unitj6.7) are 
some of the sequences that are appended to 
proteins to assist in purification. Tagged or 
fusion proteins can be rapidly and efficiently 
purified using an appropriate affinity column 
designed to tighdy bind the tag or fusion region. 
Many protein expression vectors are created 
with specific protein cleavage sites adjacent to 
the tag or fusion sequences to allow for removal 
of these sequences from the purified protein. 
This feature may prove to be essential if the tag 



or fusion sequence impairs the function of the 
protein in the relevant assays. 

PLASMID VECTORS FOR 
REPORTER GENE FUSIONS 

Plasmid vectors have been designed to sim- 
plify the construction and manipulation of re- 
porter gene fusions, where a promoter of inter- 
est is used to drive an easily scored marker gene 
(uNrr 9.6). Gene fusions provide a rapid and 
simple method for following the expression 
pattern conferred by a particular promoter. 
There are generally two types of reporter fu- 
sions, transcriptional and translational fusions. 
For transcriptional fusions, the ATG start codon 
is provided by the marker gene. In translational 
fusions, the 5' untranslated region and ATG are 
provided by the gene of interest; in fact these 
constructs may fuse a large portion, or even 
entire coding region, to the amino terminus of 
a marker. In the reporter vector, the polylinker 
cloning site is located directly upstream of the 
reporter gene for insertion of the promoter 
fragment. In vectors that are designed for ex- 
pression in eukaryotic cells, a polyadenylation 
signal is located downstream of the reporter 
gene. There are a variety of reporter genes used 
including chloramphenicol acetyltransferase, 
luciferase, P-galactosidase, secreted alkaline 
phosphatase, human growth hormone, P-glu- 
curonidase, and green fluorescent protein (also 
see UNITS 9.6-9.7C). An example of a reporter 
vector pEGFP, is shown in Figure 1.5.5. The 
choice of reporter gene will depend on the cell 
type or organism, whether the assay will be 
done in vivo or in vitro, and whether quantita- 
tive or qualitative data is desired . The replicator, 
selectable marker, and other elements of the 
reporter vector will also have to be compatible 
with the system. 

PLASMID VECTORS FOR YEAST 

Yeast plasmid vectors contain the same ba- 
sic features as £. coli vectors — replicator, se- 
lectable marker, and cloning site (unit ua). 
There are two primary types of replicators used 
in yeast plasmid vectors, autonomously repli- 
cating sequences (ARS) derived from the yeast 
chromosomes and the natural yeast 2\im plas- 
mid replicator. ARS-containing piasmids fre- 
quently contain yeast centromeric sequences to 
ensure their stable maintenance in the popula- 
tion of cells. Both ARS and 2\xm vectors aver- 
age 1 0 to 30 copies per cell . Many yeast vectors 
are "shuttle" vectors that can be maintained in 
both S. cerevisae and E. coli (e.g., pRS303; see 
Fig. 1 .5.6). These vectors have an E. coli plas- 
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mid replicator, frequently pMBl derived, and 
a yeast replicator Conversely, the yeast inte- 
grating plasmid vectors, used for introduction 
of genes into the yeast chromosome, have a 
bacterial replicator but no yeast replicator. 

The selectable markers in yeast are for the 
most part recessive markers, usually cloned 
yeast genes that are used to complement muta- 



tions in a biosynthetic pathway. For example, 
• the URA3 marker carried on a plasmid is used 
to restore the ability to grow in the absence of 
uracil to a uraS" yeast. Therefore, yeast select- 
able markers, unlike most bacterial selectable 
markers, are strain dependent. 

There are a wide variety of yeast vectors 
available that have a range of specialized func- 
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Figure 1.5,6 Map of pRS303 (adapted from Sikorski and Heiter, 1989, with permission). 



tions — e.g., expression of recombinant pro- 
teins in yeast, integration of sequences into the 
yeast genome, and cloning of very large frag- 
ments (hundreds of kilobases) of genomic 
DNA. 

PLASMID VECTORS FOR 
EXPRESSION IN CULTURED 
MAMMALIAN CELLS 

The type of vector that is used in mammalian 
cells depends on whether the experiment in- 
volves transient transfection into mammalian 
cells or the generation of stable mammalian cell 
lines carrying the construct of interest (units 
9.1-9.5). Virtually any plasmid vector that con- 
tains an appropriate construct for expression in 
mammalian cells can be used in transient as- 
says. In transient assays, the plasmid vector 
carrying the DNA of interest is transfected into 
mammalian cells, the cells are harvested some 
time later (24 to 96 hr), and the pertinent assay 



is performed. It is not necessary for plasmid 
vectors used in these assays to carry a mammal- 
ian selectable marker or to replicate in mam- 
malian cells; therefore easily manipulatable 
bacterial plasmids, like pUC, are usually the 
vectors of choice. 

Plasmid vectors carrying a selectable 
marker that functions in mammalian cells are 
necessary for the generation of stable trans- 
genic lines (UNJTS 9.5; e.g., pcDNAS.l, Fig. 
1 .5.7). In order to generate a stable mammalian 
cell line, the plasmid DNA is transfected into 
the mammalian cells, and over a period of 
several weeks the DNA of interest is selected 
for based on expression of the vector-borne 
marker. Many mammalian vectors cannot rep- 
licate in mammalian cells, and the only way to 
maintain the DNA of interest and the selectable 
marker is for the vector to randomly integrate 
into the mammalian genome. However, there 
are some plasmid vectors that carry the simian 
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virus (SV40) or bovine papilloma virus (BPV) 
ori that can replicate in mammalian cells if the 
necessary viral replication proteins are pro- 
vided; in most cases the viral replication pro- 
teins must be provided by the host cell line, 
limiting the range of cell types in which these 
vectors are useful. 

The most critical general feature of plasmid 
vectors (and also viral vectors) for creation of 
stable mammalian cell lines is the selectable 
marker (see UNtr 9.5). Unlike transient assays, 
formation and maintenance of stable cell lines 
requires selection. The number of selectable 
markers available for mammalian cells is lim- 
ited; resistance to hygromycin, puromycin, 
G418, and neomycin are the predominant 
markers used. Since expression of foreign pro- 
teins or assaying the expression of a mammal- 
ian gene may require multiple plasmids and/or 
integration of constructs into the mammalian 
chromosome, careful planning must take place 
to ensure that all constructs can be selected for 
with the limited number markers. Further, since 
selection may be necessary over a long period 
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of time— weeks for generation of the lines and 
years for their maintenance— it is important to 
recognize that some of the antibiotics used in 
selection are very expensive and thus cost may 
be a factor in experimental design. 

Specialized plasmid vectors are also used 
for production of viruses that can infect mam- 
malian cells. Plasmid vectors have been de- 
signed to produce infectious retroviral particles 
when transfected into the appropriate packag- 
ing cell line (unit 9.9). These retroviral vectors 
are then used to create stable transgenic lines 
in mammalian cell types not amenable to trans- 
fection. In addition, plasmid vectors have been 
designed to allow easy insertion of DNA se- 
quences into vaccinia virus for the purpose of 
creating recombinant viruses that overexpress 
recombinant proteins (t/Af/r 76.75). 

PLASMID VECTORS FOR 
NON-E. COLI BACTERIA 

Three features required of all bacterial plas- 
mid vectors are that they replicate (unless they 
are suicide vectors), carry a selectable marker. 
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Figure 1.5.8 Map of pRR54 (adapted from Roberts et al., 1990. with permission). 



and can be easily introduced into host cells. The 
first thing to consider when selecting a plasmid 
vector for use in a non-E. coli host is whether 
or not it can replicate and be stably maintained 
in the particular strain. Different plasmid rep- 
licators have different host ranges, some have 
a narrow host range and can only replicate in a 
specific strain while others are promiscuous 
and can replicate in a wide variety of host (e.g., 
pRR54, Fig. 1.5.8). Unfortunately, the ColEl- 
type replicators have a narrow host range, thus 
many standard £. coli vectors cannot be main- 
tained in other bacteria. However, a number of 
broad-host-range replicators, such as RK2 and 
RSFlOlO, have been well characterized and 
used to construct vectors that can replicate in 
many gram-negative bacterial species (and in 
the case of RSFlOlO, some gram-positive spe- 
cies as well). 

Antibiotic resistance genes are used as se- 
lectable markers in non-E. coli bacterial hosts; 
however, the quantity of an antibiotic used to 
select against non-plasmid-containing cells is 
usually higher than the quantity used for E. coli 
selection. Furthermore, some bacterial strains 
are inherently resistant to particular antibiotics, 
thus it is important to determine whether the 
selectable marker carried on a plasmid vector 
is functional in a particular strain. 



The favored method for introduction of plas- 
mid DNA into bacterial host cells varies widely 
with the bacterial strain. Unlike E. coli, many 
bacteria cannot be efficiendy transformed by 
chemical procedures or electroporation. In 
these cases bacterial mating is used to introduce 
plasmid DNA into the desired bacterial host. 
Mating to transfer a plasmid vector from an £. 
coli host where the vector is maintained and 
• manipulated to a recipient bacterium requires 
both cis- and rran^-acting functions. The tra (or 
mob) genes encode the fran^-acting proteins 
necessary to transfer the plasmid DNA from 
one bacterium to another, and they are usually 
located on a helper plasmid that is distinct from 
the plasmid vector. Any plasmid vector that is 
to be mobilized must contain the cw-acdng site 
called oriT where the DNA is cleaved and 
transfer is initiated. 

MAPS OFPLASMIDS 

Figures 1.5.2-1.5.11 present maps of plas- 
mids that are in widespread use or are examples 
of plasmids whose special functions make them 
useful for particular techniques described in 
this manual. Note that the trend in development 
of vectors is to include multiple features on a 
single vector, and many of these examples span 
the vector categories described. 



1.5.12 

Supplement 41 



Current Protocols in Molecular Biology 



EcoR\ 4361 C/al 23 

\ / H/ndlll29 



Aam 4286 
Sspl4170 



Seal 3846 
Pvu\ 3735 
Psrt 3609 

Ppal 3435 



/ EcoRV185 
A//7el 229 

BamHl 375 
Sp/7l 562 
Sail 651 




/^fflll 2475 
A/c/el 2297 



Eagl 939 
A/rt/l 972 
SspM1 1063 



B$m\ 1353 
SfKl1369 
Aval 1425 
ea/l 1444 
BspMI1 1664 
PvuW 2066 



r/rtl 111 2219 



Figure 1 .5.9 Map of pBR322 (Bolivar et al. 1 977; sequence in Sutcllffe, 1 978). 



pBR322 is one of the classic cloning vectors 
from which many other vectors are derived. It 
contains an amplifiable pMBl replicator and 
genes encoding resistance to ampicillin and 
tetracycline. Insertion of DNA into a restriction 
site in either drug-resistance gene usually inac- 
tivates it and allows colonies bearing plasmids 
with such insertions to be identified by their 
inability to grow on medium with that antibiotic 
(see Fig. 1,5.9; Bolivar et al., 1977; sequence 
inSutcliffe, 1978). 

pUC 1 9 belongs to a family of plasmid vec- 
tors that contains a polylinker inserted within 
the alpha region of the /acZ gene. Thepolylink- 
ers are the same as those used in the ml3mp 
series (Fig. 1.14.2). pUC19 and pUC18 have 
the same polylinker but in opposite orienta- 
tions. Under appropriate conditions (see unit ia 
for a description), colonies that bear plasmids 
containing a fragment inserted into the 
polylinker form white colonies instead of blue 
ones. These pMBl -derived plasmids (see Fig. 
1.5.2) maintain a very high-copy-number 
(1000 to 3000 per genome). Wild- type and 
recombinant plasmids confer ampicillin resis- 
tance and can be amplified with chlorampheni- 
col (Norrander et al., 1983). In addition wild- 
type plasmids confer a LacZ+ phenotype to 
appropriate cells (e.g., JMlOl cells, UNm.4). 

pBIuescript is a commonly used phagemid 
cloning vector that contains a polylinker in- 



serted into the alpha region of the lacZ gene and 
T3 and T7 promoter sequences flanking the 
cloning sites. The fl (+) filamentous phage 
origin of replication in pBIuescript SK+ allows 
for the recovery of the sense strand of the lacZ 
gene as ssDNA; the pBIuescript SK(-) vector 
(Fig. 1.5.10) with the fl origin in the opposite 
orientation, fl(-), facilitates recovery of the 
other strand. The position of the polylinker in 
the alpha region of the lacZ gene allows for 
identification of inserts based on a blue/white 
color screen under the appropriate conditions. 
The T3 and T7 promoters are recognized by 
bacteriophage RNA polymerases. Transcrip- 
tion from these promoters reads into the 
polylinker from either sidc.RNA transcripts of 
any DNA cloned into the polylinker can thus 
be produced by run-off transcription in vitro. 

pWE15 is an example of a cosmid vector 
used for cloning DNA fragments -35 to 45 kb 
(see Fig. 1 .5.3; Wahl et al., 1987). The cos sites 
allow the DNA to be cut and packaged into 
phage heads by the appropriate lambda pro- 
teins. There is a single unique Bamm cloning 
site flanked by T3 and T7 promoter sequences. 
These promoters are particularly useful for pro- 
duction of labeled RNA probes corresponding 
to the ends of the insert DNA, and these can be 
used to identify overlapping cosmids for chro- 
mosome walking and construction of cosmid 
contigs. Not\ sites flanking the cloning site can 
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Figure 1 .5.1 1 Map of pTrc99A,B,C. 



potentially be used to excise an intact insert 
fragment from the vector. The ColEl -derived 
ori and ampicillin resistance gene allow for 
replication and selection in bacteria. The SV40 
promoter (included in SV40 ori) which drives 
the neomycin phosphotransferase gene enables 
selection in eukaryotic cells. 

pBeloBACl 1 is an example of the family of 
bacterial-artificial-chromosome vectors based 
on the low-copy-number F factor replicator 
(see Fig. L5.4; Shizuya et. al., 1992). BAG 
vectors are used for cloning large DNA frag- 
ments (100 to 500 kb) in E. coli and are used 
commonly in genome-mapping strategies. 
oris, repE, parA, parB, and parC genes are the 
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essential genes that compose the F factor rep- 
licator, oris and repE genes are required for 
unidirectional replication of the plasmid, and 
parABC loci stably maintain the copy number 
at one to two per £. coli genome. There are two 
unique cloning sites {Hindlll and fiamHI) in- 
serted into the lacZ alpha region. Other useful 
features of the cloning region are (1) T7 and 
SP6 promoter sequences flanking the cloning 
sites, (2) Nqtl restriction sites flanking the clon- 
ing sites for potential excision of the insert, and 
(3) and presence of the loxP and cosN sites that 
can be cleaved by specific enzymes. The ends 
generated by cleavage at loxP or cosN can be 
used as fixed reference points in building an 
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ordered restriction map by efid labeling and 
partial restriction digestion. 

pEGFP-1 is a selectable vector for monitor- 
ing promoter activity in mammalian cells via 
fluorescence of a green fluorescent protein 
(GFP) derivative (Clontech; see Fig. 1.5.5; 
Yang et al., 1996). The vector contains a neo- 
mycin resistance gene downstream of the S V40 
early promoter for selection of stably trans- 
formed mammalian cells. It has a polylinker 
located upstream of the EGFP gene, so that the 
function of promoter sequences introduced into 
the polylinker can be assessed based on EGFP 
activity. The EGFP gene is modified from wild- 
type GFP to ensure expression in mammalian 
cells, it has silent base mutations that corre- 
spond to human codon-usage preferences, and 
sequences flanking the coding region have been 
converted to a Kozak consensus translation 
initiation signal. The vector backbone contains 
an f 1 ori for production of ssDNA, a pUC-de- 
rived ori for propagation in E, coli and a 
kanamycin resistance gene for selection in bac- 
teria. 

A series of yeast shuttle vectors (pRS304, 
305, and 306) has been created to facilitate 
manipulation of DNA in Saccharomyces cere- 
visiae (see Fig. 1.5.6; Sikorski and Hieter, 
1989). These vectors have a backbone derived 
from pBLUESCRIPT into which the features 
necessary for replication and maintenance in 
yeast have been introduced. The members of 
this series of plasmids differ only in the yeast 
selectable marker incorporated; pRS303 car- 
ries the HISS marker that complements a non- 
reverting his3 chromosomal mutation in spe- 
cific yeast strains. These plasmids contain an 
autonomously replicating sequence as well as 
a centromere sequence, CEN6, that ensures 
stable maintenance in yeast cells. 

pcDNA3.1 is a selectable cloning and ex- 
pression vector for use in mammalian cells. The 
features of this vector include a neomycin re- 
sistance gene driven by the SV40 early pro- 
moter (contained within the SV40 ori) and 
terminated by an SV40 polyadenylation signal 
for selection in mammalian cells (see Fig. 
1.5.7). In addition, due to the inclusion of the 
S V40 ori, the vector can replicate as an episome 
in cells expressing the SV40 large T antigen. 
The polylinker cloning site is located down- 
stream of strong cytomegalovirus enhancer- 
promoter sequences and upstream of the bovine 
growth hormone gene termination signals for 
high-level expression of protein-coding se- 
quences cloned into this vector. This vector also 
contains some of the more standard features of 



other plasmid vectors, including a ColEl rep- 
licator for propagation in E. coli, the ampicillin 
resistance gene for selection in E, coli, the f 1 
ori for production of ssDNA, and the T7 pro- 
moter sequence for in vitro transcription of 
DNA inserted into the polylinker. 

pRR54 is an example of a broad-host-range 
mobilizable plasmid vector. This vector con- 
tains replicator and stablilization sequences de- 
rived from the natural RK2 broad-host-range 
plasmid (see Fig. 1.5.8; Roberts et al., 1990). 
oriV h the vegetative origin of replication, trfA 
encodes trans-diCimg functions necessary for 
replication, and par encodes a locus that en- 
hances stability of the plasmid. This plasmid 
can be mated into diverse gram-negative spe- 
cies as long as the appropriate mobilization 
machinery is provided in trans because it con- 
tains the origin of conjugal transfer, oriT, The 
plasmid carries the P-lactamase gene, allowing 
for ampicillin/carbenecillin selection of plas- 
mid containing bacteria. 

The pTrc series of plasmid expression vec- 
tors facilitates regulated expression of genes in 
E, coli. These vectors carry the strong hybrid 
trp/lac promoter, the lacZ ribosome-binding 
site (RBS), the MCS of pUC18 that allows 
insertion in three reading frames, and the rrnB 
transcription terminators (see polylinker se- 
quences given below the vector diagram in Fig. 
1.5.11). These vectors are equally useful for 
expression of unfused proteins (resulting from 
insertion in the Ncol site) or for expression of 
fusion proteins (using one of the cloning sites 
in the correct translational frame). The presence 
of the lacl^ allele on the plasmid ensures com- 
plete repression of the hybrid trp/lac promoter 
during cloning and growth in any host strain 
(see Amann et al., 1988, for further details). 
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